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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Leu Leu Asn Val Leu Arg lie Cys He He Val Cys Leu Val Asn 
1 5 10 15 

Asp Gly Ala Gly Lys His Ser Glu Gly Arg Glu Arg Thr Lys Thr Tyr 
20 25 30 

Ser Leu Asn Ser Arg Gly Tyr Phe Arg Lys Glu Arg Gly Ala Arg Arg 
35 40 45 

Ser Lys He Leu Leu Val Asn Thr Lys Gly Leu Asp Glu Pro His He 
50 55 60 

Gly His Gly Asp Phe Gly Leu Val Ala Glu Leu Phe Asp Ser Thr Arg 
65 70 75 80 

Thr His Thr Asn Arg Lys Glu Pro Asp Met Asn Lys Val Lys Leu Phe 

85 90 95 

Ser Thr val Ala His Gly Asn Lys Ser Ala Arg Arg Lys Ala Tyr Asn 
100 105 HO 

Gly Ser Arg Arg Asn He Phe Ser Arg Arg Ser Phe Asp Lys Arg Asn 
115 120 125 

Thr Glu val Thr Glu Lys Pro Gly Ala Lys Met Phe Trp Asn Asn Phe 
130 135 140 

Leu val Lys Met Asn Gly Ala Pro Gin Asn Thr Ser His Gly Ser Lys 
145 150 155 160 

Ala Gin Glu He Met Lys Glu Ala Cys Lys Thr Leu Pro Phe Thr Gin 

165 170 175 

Asn He Val His Glu Asn Cys Asp Arg Met Val He Gin Asn Asn Leu 
180 185 190 

Cys Phe Gly Lys Cys He Ser Leu His val Pro Asn Gin Gin Asp Arg 
195 200 205 

Ara Asn Thr Cys Ser His Cys Leu Pro Ser Lys Phe Thr Leu Asn His 
* 210 215 220 
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Leu Thr Leu Asn Cys Thr Gly Ser Lys Asn Val Val Lys Val Val Met 
225 230 235 240 



Met val Glu Glu Cys Thr Cys Glu Ala His Lys Ser Asn Phe His Gin 

245 250 255 



Thr Ala Gin Phe Asn Met Asp Thr Ser Thr Thr Leu His His 
260 265 270 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(Xi) SEQUENCE DESCRIPTION: 


SEQ ID NO: 2 


• 






GAATTCCTAA AAGCGGCACA GTGCAGGAAC 


AGCAAGTCGC 


TCAGAAACAC 


TGCAGGGTCT 


60 


AGATATCATA CAATGTTACT AAATGTACTC 


AGGATCTGTA 


TTATCGTCTG 


CCTTGTGAAT 


120 


GATGGAGCAG GAAAACACTC AGAAGGACGA 


GAAAGGACAA 


AAACATATTC 


ACTTAACAGC 


180 


AGAGGTTACT TCAGAAAAGA AAGAGGAGCA 


CGTAGGAGCA 


AGATTCTGCT 


GGTGAATACT 


240 


AAAGGTCTTG ATGAACCCCA CATTGGGCAT 


GGTGATTTTG 


GCTTAGTAGC 


TGAACTATTT 


300 


GATTCCACCA GAACACATAC AAACAGAAAA 


GAGCCAGACA 


TGAACAAAGT 


CAAGCTTTTC 


360 


TCAACAGTTG CCCATGGAAA CAAAAGTGCA 


AGAAGAAAAG 


CTTACAATGG 


TTCTAGAAGG 


420 


AATATTTTTT CTCGCCGTTC TTTTGATAAA 


AGAAATACAG 


AGGTTACTGA 


AAAGCCTGGT 


480 


GCCAAGATGT TCTGGAACAA TTTTTTGGTT 


AAAATGAATG 


GAGCCCCACA 


GAATACAAGC 


540 


CATGGCAGTA AAGCACAGGA AATAATGAAA 


GAAGCTTGCA 


AAACCTTGCC 


CTTCACTCAG 


600 


AATATTGTAC ATGAAAACTG TGACAGGATG 


GTGATACAGA 


ACAATCTGTG 


CTTTGGTAAA 


660 


TGCATCTCTC TCCATGTTCC AAATCAGCAA 


GATCGACGAA 


ATACTTGTTC 


CCATTGCTTG 


720 


CCGTCCAAAT TTACCCTGAA CCACCTGACG 


CTGAATTGTA 


CTGGATCTAA 


GAATGTAGTA 


780 


AAGGTTGTCA TGATGGTAGA GGAATGCACG 


TGTGAAGCTC 


ATAAGAGCAA 


CTTCCACCAA 


840 



[Page 3] 



ACTGCACAGT TTAACATGGA TACATCTACT ACCCTGCACC ATTAAAAGGA CTGTCTGCCA 900 

TACAGTATGG AAATGCCCAT TTGTTGGAAT ATTCGTTACA TGCTATGTAT CTAAAGCATT 960 

ATGTTGCCTT CTGTTTCATA TAACCACATG GAATAAGGAT TGTATGAATT ATAATTAACA 1020 

AATGGCATTT TGTGTAACAT GCAAGATCTC TGTTCCATCA GTTGCAAGAT AAAAGGCAAT 1080 

ATTTGTTTGA CTTTTTTCTA CAAAATGAAT ACCCAAATAT ATGATAAGAT AATGGGGTCA 1140 

AAACTGTTAA GGGGTAATGT AATAATAGGG ACTAACAACC AATCAGCAGG TATGATTTAC 1200 

TGGTCACCTG TTTAAAAGCA AACATCTTAT TGGTTGCTAT GGGTTACTGC TTCTGGGCAA 1260 

AATGTGTGCC TCATAGGGGG GTTAGTGTGT TGTGTACTGA ATTAATTGTA TTTATTTCAT 1320 

TGTTACAATG AAGAGGATGT CTATGTTTAT TTCACTTTTA TTAATGTACA ATAAATGTTC 1380 

TTGTTTCTTT AAAAAAAAAA AAAAACTCGA G 1411 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ser Arg Thr Arg Lys Val Asp Ser Leu Leu Leu Leu Ala lie Pro 
15 10 15 

Gly Leu Ala Leu Leu Leu Leu Pro Asn Ala Tyr Cys Ala Ser Cys Glu 
20 25 30 

Pro Val Arg lie Pro Met Cys Lys Ser Met Pro Trp Asn Met Thr Lys 
35 40 45 

Met Pro Asn His Leu His His Ser Thr Gin Ala Asn Ala lie Leu Ala 
50 55 60 

lie Glu Gin Phe Glu Gly Leu Leu Thr Thr Glu Cys Ser Gin Asp Leu 
65 70 75 80 

Leu Phe Phe Leu Cys Ala Met Tyr Ala Pro lie Cys Thr lie Asp Phe 

85 90 95 
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Gin His Glu Pro He Lys Pro Cys Lys Ser Val Cys Glu Arg Ala Arg 
100 105 110 

Ala Gly Cys Glu Pro He Leu He Lys Tyr Arg His Thr Trp Pro Glu 
115 120 125 

Ser Leu Ala Cys Glu Glu Leu Pro Val Tyr Asp Arg Gly Val Cys He 
130 135 140 

Ser Pro Glu Ala He Val Thr Val Glu Gin Gly Thr Asp Ser Met Pro 
145 150 155 160 

Asp Phe Ser Met Asp Ser Asn Asn Gly Asn Cys Gly Ser Gly Arg Glu 

165 170 175 

His Cys Lys Cys Lys Pro Met Lys Ala Thr Gin Lys Thr Tyr Leu Lys 
180 185 190 

Asn Asn Tyr Asn Tyr Val He Arg Ala Lys Val Lys Glu Val Lys Val 
195 200 205 

Lys Cys His Asp Ala Thr Ala He Val Glu Val Lys Glu He Leu Lys 
210 215 220 

Ser Ser Leu Val Asn He Pro Lys Asp Thr Val Thr Leu Tyr Thr Asn 
225 230 235 240 

Ser Gly Cys Leu Cys Pro Gin Leu val Ala Asn Glu Glu Tyr He He 

245 250 255 

Met Gly Tyr Glu Asp Lys Glu Arg Thr Arg Leu Leu Leu Val Glu Gly 
260 265 270 

Ser Leu Ala Glu Lys Trp Arg Asp Arg Leu Ala Lys Lys Val Lys Arg 
275 280 285 

Trp Asp Gin Lys Leu Arg Arg Pro Arg Lys Ser Lys Asp Pro val Ala 
290 295 300 



Pro He Pro Asn Lys Asn Ser Asn Ser Arg Gin Ala Arg Ser 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: CDNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GAATTCCCTT TCACACAGGA CTCCTGGCAG AGGTGAATGG TTAGCCCTAT GGATTTGGTT 60 

TGTTGATTTT GACACATGAT TGATTGCTTT CAGATAGGAT TGAAGGACTT GGATTTTTAT 120 

CTAATTCTGC ACTTTTAAAT TATCTGAGTA ATTGTTCATT TTGTATTGGA TGGGACTAAA 180 

GATAAACTTA ACTCCTTGCT TTTGACTTGC CCATAAACTA TAAGGTGGGG TGAGTTGTAG 240 

TTGCTTTTAC ATGTGCCCAG ATTTTCCCTG TATTCCCTGT ATTCCCTCTA AAGTAAGCCT 300 

ACACATACAG GTTGGGCAGA ATAACAATGT CTCGAACAAG GAAAGTGGAC TCATTACTGC 360 

TACTGGCCAT ACCTGGACTG GCGCTTCTCT TATTACCCAA TGCTTACTGT GCTTCGTGTG 420 

AGCCTGTGCG GATCCCCATG TGCAAATCTA TGCCATGGAA CATGACCAAG ATGCCCAACC 480 

ATCTCCACCA CAGCACTCAA GCCAATGCCA TCCTGGCAAT TGAACAGTTT GAAGGTTTGC 540 

TGACCACTGA ATGTAGCCAG GACCTTTTGT TCTTTCTGTG TGCCATGTAT GCCCCCATTT 600 

GTACCATCGA TTTCCAGCAT GAACCAATTA AGCCTTGCAA GTCCGTGTGC GAAAGGGCCA 660 

GGGCCGGCTG TGAGCCCATT CTCATAAAGT ACCGGCACAC TTGGCCAGAG AGCCTGGCAT 720 

GTGAAGAGCT GCCCGTATAT GACAGAGGAG TCTGCATCTC CCCAGAGGCT ATCGTCACAG 780 
TGGAACAAGG AACAGATTCA ATGCCAGACT TCTCCATGGA TTCAAACAAT GGAAATTGCG 840 
GAAGCGGCAG GGAGCACTGT AAATGCAAGC CCATGAAGGC AACCCAAAAG ACGTATCTCA 900 
AGAATAATTA CAATTATGTA ATCAGAGCAA AAGTGAAAGA GGTGAAAGTG AAATGCCACG 960 

ACGCAACAGC AATTGTGGAA GTAAAGGAGA TTCTCAAGTC TTCCCTAGTG AACATTCCTA 1020 

AAGACACAGT GACACTGTAC ACCAACTCAG GCTGCTTGTG CCCCCAGCTT GTTGCCAATG 1080 

AGGAATACAT AATTATGGGC TATGAAGACA AAGAGCGTAC CAGGCTTCTA CTAGTGGAAG 1140 

GATCCTTGGC CGAAAAATGG AGAGATCGTC TTGCTAAGAA AGTCAAGCGC TGGGATCAAA 1200 

AGCTTCGACG TCCCAGGAAA AGCAAAGACC CCGTGGCTCC AATTCCCAAC AAAAACAGCA 1260 

ATTCCAGACA AGCGCGTAGT TAGACTAACG GAAAGGTGTA TGGAAACTCT ATGGACTTTG 1320 

AAACTAAGAT TTGCATTGTT GGAAGAGCAA AAAAGAAATT GCACTACAGC ACGTTATATT 1380 

CTATTGTTTA CTACAAGAAG CTGGTTTAGT TGATTGTAGT TCTCCTTTCC TTCTTTTTTT 1440 
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TTATAACTAT ATTTGCACGT GTTCCCAGGC AATTGTTTTA TTCAACTTCC AGTGACAGAG 1500 

CAGTGACTGA ATGTCTCAGC CTAAAGAAGC TCAATTCATT TCTGATCAAC TAATGGTGAC 1560 

AAGTGTTTGA TACTTGGGGA AAGTGAACTA ATTGCAATGG TAAATCAGAG AAAAGTTGAC 1620 

CAATGTTGCT TTTCCTGTAG ATGAACAAGT GAGAGATCAC ATTTAAATGA TGATCACTTT 1680 

CCATTTAATA CTTTCAGCAG TTTTAGTTAG ATGACATGTA GGATGCACCT AAATCTAAAT 1740 

ATTTTATCAT AAATGAAGAG CTGGTTTAGA CTGTATGGTC ACTGTTGGGA AGGTAAATGC 1800 

CTACTTTGTC AATTCTGTTT TAAAAATTGC CTAAATAAAT ATTAAGTCCT AAATAAAAAA 1860 
AAAAAAAAAA AAAAA 



1875 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 979 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Leu Leu Leu Phe Arg Ala He Pro Met Leu Leu Leu Gly Leu Met 

5 10 15 



1 



Val Leu Gin Thr Asp Cys Glu He Ala Gin Tyr Tyr He Asp Glu Glu 
20 25 30 

Glu Pro Pro Gly Thr Val He Ala Val Leu Ser Gin His Ser He Phe 
35 40 45 

Asn Thr Thr Asp He Pro Ala Thr Asn Phe Arg Leu Met Lys Gin Phe 
50 55 60 

Asn Asn Ser Leu He Gly Val Arg Glu Ser Asp Gly Gin Leu Ser lie 
65 70 75 80 

Met Glu Arg He Asp Arg Glu Gin He Cys Arg Gin Ser Leu His Cys 



85 



Asn Leu Ala Leu Asp val Val Ser Phe Ser Lys Gly His Phe Lys Leu 
100 105 110 
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Leu Asn Val Lys Val Glu Val Arg Asp He Asn Asp His Ser Pro His 
115 120 125 

Phe Pro Ser Glu He Met His Val Glu Val Ser Glu Ser Ser Ser Val 
130 135 140 

Gly Thr Arg He Pro Leu Glu He Ala He Asp Glu Asp Val Gly Ser 
145 150 155 160 

Asn Ser He Gin Asn Phe Gin He Ser Asn Asn Ser His Phe Ser He 

165 170 175 

Asp val Leu Thr Arg Ala Asp Gly Val Lys Tyr Ala Asp Leu val Leu 
180 185 190 

Met Arg Glu Leu Asp Arg Glu He Gin Pro Thr Tyr He Met Glu Leu 
195 200 205 



Leu Ala Met Asp Gly Gly Val Pro Ser Leu Ser Gly Thr Ala Val Val 
210 215 220 

Asn He Arg Val Leu Asp Phe Asn Asp Asn Ser Pro Val Phe Glu Arg 
225 230 235 240 

Ser Thr He Ala val Asp Leu Val Glu Asp Ala Pro Leu Gly Tyr Leu 

245 250 255 

Leu Leu Glu Leu His Ala Thr Asp Asp Asp Glu Gly Val Asn Gly Glu 
260 265 270 

He Val Tyr Gly Phe Ser Thr Leu Ala Ser Gin Glu Val Arg Gin Leu 
275 280 285 

Phe Lys He Asn Ser Arg Thr Gly Ser Val Thr Leu Glu Gly Gin val 
290 295 300 

Asp Phe Glu Thr Lys Gin Thr Tyr Glu Phe Glu val Gin Ala Gin Asp 
305 310 315 320 

Leu Gly Pro Asn Pro Leu Thr Ala Thr Cys Lys val Thr val His He 

325 330 335 

Leu Asp Val Asn Asp Asn Thr Pro Ala He Thr He Thr Pro Leu Thr 
340 345 350 

Thr Val Asn Ala Gly val Ala Tyr He Pro Glu Thr Ala Thr Lys Glu 
355 360 365 

Asn Phe He Ala Leu He Ser Thr Thr Asp Arg Ala Ser Gly Ser Asn 
370 375 380 
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Gly Gin Val Arg Cys Thr Leu Tyr Gly His Glu His Phe Lys Leu Gin 
385 390 395 400 

Gin Ala Tyr Glu Asp Ser Tyr Met lie Val Thr Thr Ser Thr Leu Asp 

405 410 415 

Arg Glu Asn lie Ala Ala Tyr Ser Leu Thr Val Val Ala Glu Asp Leu 
420 425 430 

Gly Phe Pro Ser Leu Lys Thr Lys Lys Tyr Tyr Thr Val Lys Val Ser 
435 440 445 

Asp Glu Asn Asp Asn Ala Pro Val Phe Ser Lys Pro Gin Tyr Glu Ala 
450 455 460 

Ser lie Leu Glu Asn Asn Ala Pro Gly Ser Tyr lie Thr Thr Val lie 
465 470 475 480 

Ala Arg Asp Ser Asp Ser Asp Gin Asn Gly Lys Val Asn Tyr Arg Leu 

485 490 495 

Val Asp Ala Lys Val Met Gly Gin Ser Leu Thr Thr Phe Val Ser Leu 
500 505 510 

Asp Ala Asp Ser Gly Val Leu Arg Ala Val Arg Ser Leu Asp Tyr Glu 
515 520 525 

Lys Leu Lys Gin Leu Asp Phe Glu lie Glu Ala Ala Asp Asn Gly lie 
530 535 540 

Pro Gin Leu Ser Thr Arg Val Gin Leu Asn Leu Arg lie Val Asp Gin 
545 550 555 560 

Asn Asp Asn Cys Pro Val lie Thr Asn Pro Leu Leu Asn Asn Gly Ser 

565 570 575 

Gly Glu Val Leu Leu Pro lie Ser Ala Pro Gin Asn Tyr Leu val Phe 
580 585 590 

Gin Leu Lys Ala Glu Asp Ser Asp Glu Gly His Asn Ser Gin Leu Phe 
595 600 605 

Tyr Thr lie Leu Arg Asp Pro Ser Arg Leu Phe Ala lie Asn Lys Glu 
610 615 620 

Ser Gly Glu Val Phe Leu Lys Lys Gin Leu Asn Ser Asp His Ser Glu 
625 630 635 640 

Asp Leu Ser lie Val Val Ala Val Tyr Asp Leu Gly Arg Pro Ser Leu 

645 650 655 
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Ser Thr Asn Ala Thr Val Lys Phe lie Leu Thr Asp Ser Phe Pro Ser 
660 665 670 

Asn Val Glu Val Val He Leu Gin Pro Ser Ala Glu Glu Gin His Gin 
675 680 685 

He Asp Met Ser He lie Phe He Ala Val Leu Ala Gly Gly Cys Ala 
690 695 700 

Leu Leu Leu Leu Ala He Phe Phe Val Ala Cys Thr Cys Lys Lys Lys 
705 710 715 720 

Ala Gly Glu Phe Lys Gin Val Pro Glu Gin His Gly Thr Cys Asn Glu 

725 730 735 



Glu Arg Leu Leu Ser Thr Pro Ser Pro Gin Ser Val Ser Ser Ser Leu 
740 745 750 

Ser Gin Ser Glu Ser Cys Gin Leu Ser He Asn Thr Glu Ser Glu Asn 
755 760 765 

Cys Ser Val Ser Ser Asn Gin Glu Gin His Gin Gin Thr Gly He Lys 
770 775 780 

His Ser He Ser Val Pro Ser Tyr His Thr Ser Gly Trp His Leu Asp 
785 790 795 800 

Asn Cys Ala Met Ser He Ser Gly His Ser His Met Gly His He Ser 

805 810 815 



U Thr Lys Asp Ser Gly Lys Gly Asp Ser Asp Phe Asn Asp Ser Asp Ser 
U 820 825 830 



Asp Thr Ser Gly Glu Ser Gin Lys Lys Ser He Glu Gin Pro Met Gin 
835 840 845 

Ala Gin Ala Ser Ala Gin Tyr Thr Asp Glu Ser Ala Gly Phe Arg His 
850 855 860 

Ala Asp Asn Tyr Phe Ser His Arg He Asn Lys Gly Pro Glu Asn Gly 
865 870 875 880 

Asn Cys Thr Leu Gin Tyr Glu Lys Gly Tyr Arg Leu Ser Tyr Ser Val 

885 890 895 

Ala Pro Ala His Tyr Asn Thr Tyr His Ala Arg Met Pro Asn Leu His 
900 905 910 

He Pro Asn His Thr Leu Arg Asp Pro Tyr Tyr His He Asn Asn Pro 
915 920 925 
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Val Ala Asn Arg Met His Ala Glu Tyr Glu Arg Asp Leu Val Asn Arg 
930 935 940 

Ser Ala Thr Leu Ser Pro Gin Arg Ser Ser Ser Arg Tyr Gin Glu Phe 
945 950 955 960 

Asn Tyr Ser Pro Gin lie Ser Arg Gin Leu His Pro Ser Glu lie Ala 

965 970 975 

Thr Thr Phe 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



GAATTCCCAG AGATGAACTC 


CTTGAGATTG 


TTTTAAATGA 


CTGCAGGTCT 


GGAAGGATTC 


60 


ACATTGCCAC ACTGTTTCTA 


GGCATGAAAA 


AACTGCAAGT 


TTCAACTTTG 


TTTTTGGTGC 


120 


AACTTTGATT CTTCAAGATG 


CTGCTTCTCT 


TCAGAGCCAT 


TCCAATGCTG 


CTGTTGGGAC 


180 


TGATGGTTTT ACAAACAGAC 


TGTGAAATTG 


CCCAGTACTA 


CATAGATGAA 


GAAGAACCCC 


240 


CTGGCACTGT AATTGCAGTG 


TTGTCACAAC 


ACTCCATATT 


TAACACTACA 


GATATACCTG 


300 


CAACCAATTT CCGTCTAATG 


AAGCAATTTA 


ATAATTCCCT 


TATCGGAGTC 


CGTGAGAGTG 


360 


ATGGGCAGCT GAGCATCATG 


GAGAGGATTG 


ACCGGGAGCA 


AATCTGCAGG 


CAGTCCCTTC 


420 


ACTGCAACCT GGCTTTGGAT GTGGTCAGCT 


TTTCCAAAGG 


ACACTTCAAG 


CTTCTGAACG 


480 


TGAAAGTGGA GGTGAGAGAC ATTAATGACC 


ATAGCCCTCA 


CTTTCCCAGT 


GAAATAATGC 


540 


ATGTGGAGGT GTCTGAAAGT 


TCCTCTGTGG 


GCACCAGGAT 


TCCTTTAGAA 


ATTGCAATAG 


600 


ATGAAGATGT TGGGTCCAAC 


TCCATCCAGA 


ACTTTCAGAT 


CTCAAATAAT 


AGCCACTTCA 


660 


GCATTGATGT GCTAACCAGA 


GCAGATGGGG 


TGAAATATGC 


AGATTTAGTC 


TTAATGAGAG 


720 


AACTGGACAG GGAAATCCAG 


CCAACATACA 


TAATGGAGCT 


ACTAGCAATG 


GATGGGGGTG 


780 
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TACCATCACT ATCTGGTACT GCAGTGGTTA 


ACATCCGAGT 


CCTGGACTTT 


AATGATAACA 


840 


GCCCAGTGTT TGAGAGAAGC ACCATTGCTG 


TGGACCTAGT 


AGAGGATGCT 


CCTCTGGGAT 


900 


ACCTTTTGTT GGAGTTACAT GCTACTGACG 


ATGATGAAGG 


AGTGAATGGA 


GAAATTGTTT 


960 


ATGGATTCAG CACTTTGGCA TCTCAAGAGG 


TACGTCAGCT 


ATTTAAAATT 


AACTCCAGAA 


1020 


CTGGCAGTGT TACTCTTGAA GGCCAAGTTG 


ATTTTGAGAC 


CAAGCAGACT 


TACGAATTTG 


1080 


AGGTACAAGC CCAAGATTTG GGCCCCAACC 


CACTGACTGC 


TACTTGTAAA 


GTAACTGTTC 


1140 


ATATACTTGA TGTAAATGAT AATACCCCAG 


CCATCACTAT 


TACCCCTCTG 


ACTACTGTAA 


1200 


ATGCAGGAGT TGCCTATATT CCAGAAACAG 


CCACAAAGGA 


GAACTTTATA 


GCTCTGATCA 


1260 


GCACTACTGA CAGAGCCTCT GGATCTAATG 


GACAAGTTCG 


CTGTACTCTT 


TATGGACATG 


1320 


AGCACTTTAA ACTACAGCAA GCTTATGAGG 


ACAGTTACAT 


GATAGTTACC 


ACCTCTACTT 


1380 


TAGACAGGGA AAACATAGCA GCGTACTCTT 


TGACAGTAGT 


TGCAGAAGAC 


CTTGGCTTCC 


1440 


CCTCATTGAA GACCAAAAAG TACTACACAG 


TCAAGGTTAG 


TGATGAGAAT 


GACAATGCAC 


1500 


CTGTATTTTC TAAACCCCAG TATGAAGCTT 


CTATTCTGGA 


AAATAATGCT 


CCAGGCTCTT 


1560 


ATATAACTAC AGTGATAGCC AGAGACTCTG 


ATAGTGATCA 


AAATGGCAAA 


GTAAATTACA 


1620 


GACTTGTGGA TGCAAAAGTG ATGGGCCAGT 


CACTAACAAC 


ATTTGTTTCT 


CTTGATGCGG 


1680 


ACTCTGGAGT ATTGAGAGCT GTTAGGTCTT 


TAGACTATGA 


AAAACTTAAA 


CAACTGGATT 


1740 


TTGAAATTGA AGCTGCAGAC AATGGGATCC 


CTCAACTCTC 


CACTCGCGTT 


CAACTAAATC 


1800 


TCAGAATAGT TGATCAAAAT GATAATTGCC 


CTGTGATAAC 


TAATCCTCTT 


CTTAATAATG 


1860 


GCTCGGGTGA AGTTCTGCTT CCCATCAGCG 


CTCCTCAAAA 


CTATTTAGTT 


TTCCAGCTCA 


1920 


AAGCCGAGGA TTCAGATGAA GGGCACAACT 


CCCAGCTGTT 


CTATACCATA 


CTGAGAGATC 


1980 


CAAGCAGATT GTTTGCCATT AACAAAGAAA 


GTGGTGAAGT 


GTTCCTGAAA 


AAACAATTAA 


2040 


ACTCTGACCA TTCAGAGGAC TTGAGCATAG 


TAGTTGCAGT 


GTATGACTTG 


GGAAGACCTT 


2100 


CATTATCCAC CAATGCTACA GTTAAATTCA 


TCCTCACCGA 


CTCTTTTCCT 


TCTAACGTTG 


2160 


AAGTCGTTAT TTTGCAACCA TCTGCAGAAG 


AGCAGCACCA 


GATCGATATG 


TCCATTATAT 


2220 


TCATTGCAGT GCTGGCTGGT GGTTGTGCTT 


TGCTACTTTT 


GGCCATCTTT 


TTTGTGGCCT 


2280 


GTACTTGTAA AAAGAAAGCT GGTGAATTTA 


AGCAGGTACC 


TGAACAACAT 


GGAACATGCA 


2340 
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TATPTV TV PTV TlPP 

A X oAAvjAALVj 


PpmpmrpTV » r»P 
tXXGX XAAGC 


tv f*r*c*o tv tptp 


P p p 7a p tp p p m 
ULLAGXuGGX 


UltJ. iti. Itx 


ip m n rn p m p Tv r» rn 


0 A C\ (\ 


r*rnr* tv r*TP tv tp 
C TGAG X (JA X G 


p/"^ TV * PTPT/"»P 


TV TP TV TV TTV Pipr 

ATCAATACTG 


TV Tv mp rn/^ tv r* Tv ta 
AATU TG AG AA 


1 1 vjUAbLvj 1 vj 


rnppfnpmTv TA pp 




» * f+7if*f* JlPPTV 

AAGAGCAGGA 


rpO 7V f' l /" , 7S TV TV I^TV 

TuAGCAAACA 


tv m TV TV T\ 

GGCATAAAGC 


tv ^rnr<o tv m/^ m/^ 

ACTLCATCTC 


rn r* tn ta r» r> ta m m 


T TA TP TA P TV p TV rn 
1 A 1 LAUAUA 1 




CTGGTTGGCA 


CCTGGACAAT 


TGTGCAATGA 


GCATAAGTGG 


tv o tv mm/n m/n tv r* 

ACATTCTCAC 


ta rnp/^ppp tv p ta 

ATGGGGUALA 


0 c O A 


fnm % tv tv H * 

TTAGTACAAA 


GGACAGTGGC 


TV TV TV m« m« 

AAAGGAGATA 


GTGACTTCAA 


TA ^ TV /"^ m/** TV /•* 

TGACAGTGAC 


mn»n/' TV rn tv P rn TA 


0 £ VI A 


GTGGAGAATC 


TVfTVTVTVTV^TVTvP 

ACAAAAGAAG 


AGCATTGAGC 


AGCCAATGCA 


GGC AC AAGC C 


TA P m P P nip TV TA rn 
Avj X X LAA X 


/ / uu 


TV OTV TV f+lK. ITV^TV 

ACACAGATGA 


ATCAGLAGGG 


TTCCGACATG 


o /"^ tv rn tv tv o m tv 

CCGATAACTA 


mmmo 7V rT 1 /"* Tv O 

TTTCAGuuAL 


PPTA7ArppTV7AP7A 

UVjAA X LAALA 


6 / OU 


AGGGTCCAGA 


iv tv t\ mrro tv tv p 
AAATGGGAAC 


TPr» tv p tv mmp 
TGCACATTGC 


TVTVmTVm/^TVTVTVTV 

AATATGAAAA 


GGGL T A X AG A 


prnprnpmrnTv prp 


Z QC U 


CTGTAGCTCC 


m/^/"^m/^ tv mm tv 

TGCTCATTAC 


tv TvTiv/"*^rpTv f^r* 

AATACCTAC C 


tv fno tv tv f* tv tv rn 
ATGCAAGAAT 


GGGTAAUL I G 


PTAPTAfPTAPPPTA 

L AL A X AUUvsA 


9 o on 


« /*^tv m% rx^yfy m 

ACCATACCCT 


TAGAGACCCT 


TATTACCATA 


% * m tv tv TT\r* 

TC AAT AATC C 


TGTTGCTAAT 


PPP TV rnppTv pp 

CGGA1 GUALl? 




CGGAATATGA 


tv tv ^*tv ^ tv nullum 

AAGAGATTTA 


f*TT\f* TV TV TV r> TV TV 

GTCAACAGAA 


GTGCAACGTT 


TV m/^mr»/^ TV r» 

ATCTCLGCAG 


tv P7a rppprnprn ta 
AUAXLvjXUXA 


jUUU 


GCAGATACCA 


AGAATTCAAT 


TACAGTCCGC 


iv/^Tvnnivm^ivTv^ 

AGATATCAAG 


tv o % ^nfnm/^ tv rn 

ACAGCTTCAT 


ppmrriPTv ptv tv tv 
CCTTCAGAAA 


oUbU 


TTGCTACAAC 


CTTTTAATCA 


TTAGGCATGC 


7LAGTGAGAAT 


TV > TV TiPPP 

GCACAAAGGC 


tv tv ^mp^rnrnrn A 

AAGTGCTTTA 


^ 1 OA 


GCATGAAAGC 


TAAATATATG 


GAGTCTCCCC 


TTTCCCTCTG 


ATGGATGGGG 


PPTV PTV PTV PTV P 

VjCjAGACACAv? 


Q1 OA 

J XoU 


GACAGTGCAT 


TATvTvmTimTiOTv^* 

AAATATACAG 


CTGCTTTCTA 


TTTGCATTTc 


AL 1 i \j\3\3 AA i 


Ti rn rn rn rn p rrt rn rp rp 
XXXXXVsXXXX 




TTTTACATAT 


TTATTTTTCC 


TGAATTGAAT 


GTGACATx G 1 


CLIVjIUAuU i 


ATaPTAPPTA&T 
AAt X AVjU AA 1 




» ta mppTVP TV 

TAAATvJCACA 


GACCTACAGT 


CAAATATTTG 


AV7VJV7UUWV 1 V3 


71 TV TV n TV r»r TA p TV 


TP TA P TP 7A tlfl 71 




CCTAAAGTGG 


CCTTTTTACT 


TTTAGCAGCT 


CCTGGGTCTG 


CCCTCTGTGT 


m tv TvTr»TvPPr*p 
TAATLAGCCu 


^ /i o n 


CTGGTCAAGT 


CCTGAGTAGG ATCATGGCGT 


TTTTATATGC 


ATCTCACCTA 


CTTTGGACGT 


3480 


GATTTACACA 


TAATAGGAAA CGCTTGGTTT 


CAGTGAAGTC 


TGTGTTGTAT 


ATATTCTGTT 


3540 


ATATACACGC 


ATTTTGTGTT 


TGTGTATATA 


TTTCAAGTCC 


ATTCAGATAT 


GTGTATATAG 


3600 


TGCAGACCTT 


GTAAATTAAA 


TATTCTGATA 


CTTTTTCCTC 


AATAAATATT 


TAAAT 


3655 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Val Cys Cys Gly Pro Gly Arg Met Leu Leu Gly Trp Ala Gly Leu 
15 10 15 

Leu Val Leu Ala Ala Leu Cys Leu Leu Gin Val Pro Gly Ala Gin Ala 
20 25 30 

Ala Ala Cys Glu Pro Val Arg lie Pro Leu Cys Lys Ser Leu Pro Trp 
35 40 45 

Asn Met Thr Lys Met Pro Asn His Leu His His Ser Thr Gin Ala Asn 
50 55 60 

Ala lie Leu Ala Met Glu Gin Phe Glu Gly Leu Leu Gly Thr His Cys 
65 70 75 80 

Ser Pro Asp Leu Leu Phe Phe Leu Cys Ala Met Tyr Ala Pro lie Cys 

85 90 95 

Thr lie Asp Phe Gin His Glu Pro lie Lys Pro Cys Lys Ser Val Cys 
100 105 110 

Glu Arg Ala Arg Gin Gly Cys Glu Pro lie Leu lie Lys Tyr Arg His 
115 120 125 

Ser Trp Pro Glu Ser Leu Ala Cys Asp Glu Leu Pro Val Tyr Asp Arg 
130 135 140 

Gly Val Cys lie Ser Pro Glu Ala lie Val Thr Ala Asp Gly Ala Asp 
145 150 155 160 

Phe Pro Met Asp Ser Ser Thr Gly His Cys Arg Gly Ala Ser Ser Glu 

165 170 175 

Arg Cys Lys Cys Lys Pro Val Arg Ala Thr Gin Lys Thr Tyr Phe Arg 
180 185 190 

Asn Asn Tyr Asn Tyr Val lie Arg Ala Lys Val Lys Glu Val Lys Met 
195 200 205 

Lys Cys His Asp Val Thr Ala Val Val Glu Val Lys Glu lie Leu Lys 
210 215 220 

Ala Ser Leu Val Asn lie Pro Arg Asp Thr val Asn Leu Tyr Thr Thr 
225 230 235 240 
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Ser Gly Cys Leu Cys Pro Pro Leu Thr Val Asn Glu Glu Tyr Val He 

245 250 255 

Met Gly Tyr Glu Asp Glu Glu Arg Ser Arg Leu Leu Leu Val Glu Gly 
260 265 270 

Ser He Ala Glu Lys Trp Lys Asp Arg Leu Gly Lys Lys Val Lys Arg 
275 280 285 

Trp Asp Met Lys Leu Arg His Leu Gly Leu Gly Lys Thr Asp Ala Ser 
290 295 300 

Asp Ser Thr Gin Asn Gin Lys Ser Gly Arg Asn Ser Asn Pro Arg Pro 
305 310 315 320 



Ala Arg Ser 



(2) INFORMATION FOR SEQ ID NO: 8; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

AAGCCTGGGA CCATGGTCTG CTGCGGCCCG GGACGGATGC TGCTAGGATG GGCCGGGTTG 60 

CTAGTCCTGG CTGCTCTCTG CCTGCTCCAG GTGCCCGGAG CTCAGGCTGC AGCCTGTGAG 120 

CCTGTCCGCA TCCCGCTGTG CAAGTCCCTT CCCTGGAACA TGACCAAGAT GCCCAACCAC 180 

CTGCACCACA GCACCCAGGC TAACGCCATC CTGGCCATGG AACAGTTCGA AGGGCTGCTG 240 

GGCACCCACT GCAGCCCGGA TCTTCTCTTC TTCCTCTGTG CAATGTACGC ACCCATTTGC 300 

ACCATCGACT TCCAGCACGA GCCCATCAAG CCCTGCAAGT CTGTGTGTGA GCGCGCCCGA 360 

CAGGGCTGCG AGCCCATTCT CATCAAGTAC CGCCACTCGT GGCCGGAAAG CTTGGCCTGC 420 

GACGAGCTGC CGGTGTACGA CCGCGGCGTG TGCATCTCTC CTGAGGCCAT CGTCACCGCG 480 

GACGGAGCGG ATTTTCCTAT GGATTCAAGT ACTGGACACT GCAGAGGGGC AAGCAGCGAA 540 

CGTTGCAAAT GTAAGCCTGT CAGAGCTACA CAGAAGACCT ATTTCCGGAA CAATTACAAC 600 
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TATGTCATCC GGGCTAAAGT 


TAAAGAGGTA 


AAGATGAAAT 


GTCATGATGT 


GACCGCCGTT 


660 


GTGGAAGTGA AGGAAATTCT AAAGGCATCA 


CTGGTAAACA 


TTCCAAGGGA 


CACCGTCAAT 


720 


CTTTATACCA CCTCTGGCTG 


CCTCTGTCCT 


CCACTTACTG 


TCAATGAGGA 


ATATGTCATC 


780 


ATGGGCTATG AAGACGAGGA 


ACGTTCCAGG 


TTACTCTTGG 


TAGAAGGCTC 


TATAGCTGAG 


840 


AAGTGGAAGG ATCGGCTTGG 


TAAGAAAGTC 


AAGCGCTGGG 


ATATGAAACT 


CCGACACCTT 


900 


GGACTGGGTA AAACTGATGC 


TAGCGATTCC 


ACTCAGAATC 


AGAAGTCTGG 


CAGGAACTCT 


960 


AATCCCCGGC CAGCACGCAG 


CTAAATCCTG 


AAATGTAAAA 


GGCCACACCC 


ACGGACTCCC 


1020 


TTCTAAGACT GGCGCTGGTG 


GACTAACAAA 


GGAAAACCGC 


ACAGTTGTGC 


TCGTGACCGA 


1080 


TTGTTTACCG CAGACACCGC 


GTGGCTACCG 


AAGTTACTTC 


CGGTCCCCTT 


TCTCCTGCTT 


1140 


CTTAATGGCG TGGGGTTAGA 


TCCTTTAATA 


TGTTATATAT 


TCTGTTTCAT 


CAATCACGTG 


1200 


GGGACTGTTC TTTTGCAACC 


AGAATAGTAA 


ATTAAATATG 


TTGATGCTAA 


GGTTTCTGTA 


1260 


CTGGACTCCC TGGGTTTAAT 


TTGGTGTTCT 


GTACCCTGAT 


TGAGAATGCA 


ATGTTTCATG 


1320 


TAAAGAGAGA ATCCTGGTCA TATCTCAAGA 


ACTAGATATT 


GCTGTAAGAC 


AGCCTCTGCT 


1380 


GCTGCGCTTA TAGTCTTGTG TTTGTATGCC 


TTTGTCCATT 


TCCCTCATGC 


TGTGAAAGTT 


1440 


ATACATGTTT ATAAAGGTAG AACGGCATTT 


TGAAATCAGA 


CACTGCACAA 


GCAGAGTAGC 


1500 


CCAACACCAG GAAGCATTTA 


TGAGGAAACG 


CCACACAGCA 


TGACTTATTT 


TCAAGATTGG 


1560 


CAGGCAGCAA AATAAATAGT GTTGGGAGCC 


AAGAAAAGAA 


TATTTTGCCT 


GGTTAAGGGG 


1620 


CACACTGGAA TCAGTAGCCC TTGAGCCATT AACAGCAGTG 


TTCTTCTGGC 


AAGTTTTTGA 


1680 


TTTGTTCATA AATGTATTCA CGAGCATTAG AGATGAACTT ATAACTAGAC ATCTGTTGTT 


1740 


ATCTCTATAG CTCTGCTTCC TTCTAAATCA AACCCATTGT 


TGGATGCTCC 


CTCTCCATTC 


1800 


ATAAATAAAT TTGGCTTGCT GTATTGGCCA 


GGAAAAGAAA 


GTATTAAAGT 


ATGCATGCAT 


1860 


GTGCACCAGG GTGTTATTTA ACAGAGGTAT 


GTAACTCTAT 


AAAAGACTAT 


AATTTACAGG 


1920 


ACACGGAAAT GTGCACATTT 


GTTTACTTTT 


TTTCTTCCTT 


TTGCTTTGGG 


CTTGTGATTT 


1980 


TGGTTTTTGG TGTGTTTATG TCTGTATTTT 


GGGGGGTGGG 


TAGGTTTAAG 


CCATTGCACA 


2040 


TTCAAGTTGA ACTAGATTAG AGTAGACTAG 


GCTCATTGGC 


CTAGACATTA 


TGATTTGAAT 


2100 
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TTGTGTTGTT TAATGCTCCA TCAAGATGTC TAATAAAAGG AATATGGTTG TCAACAGAGA 2160 
CGACAACAAC AACAAA 2176 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met val Cys Gly Ser Pro Gly Gly Met Leu Leu Leu Arg Ala Gly Leu 
15 10 15 

Leu Ala Leu Ala Ala Leu Cys Leu Leu Arg Val Pro Gly Ala Arg Ala 
20 25 30 

Ala Ala Cys Glu Pro Val Arg He Pro Leu Cys Lys Ser Leu Pro Trp 
35 40 45 

Asn Met Thr Lys Met Pro Asn His Leu His His Ser Thr Gin Ala Asn 
50 55 60 

Ala He Leu Ala He Glu Gin Phe Glu Gly Leu Leu Gly Thr His Cys 
65 70 75 80 

Ser Pro Asp Leu Leu Phe Phe Leu Cys Ala Met Tyr Ala Pro He Cys 

85 90 95 

Thr He Asp Phe Gin His Glu Pro He Lys Pro Cys Lys Ser Val Cys 
100 105 110 

Glu Arg Ala Arg Gin Gly Cys Glu Pro He Leu He Lys Tyr Arg His 
115 120 125 

Ser Trp Pro Glu Asn Leu Ala Cys Glu Glu Leu Pro Val Tyr Asp Arg 
130 135 140 

Gly Val Cys lie Ser Pro Glu Ala He val Thr Ala Asp Gly Ala Asp 
145 150 155 160 

Phe Pro Met Asp Ser Ser Asn Gly Asn Cys Arg Gly Ala Ser Ser Glu 

165 170 175 
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Arg Cys Lys Cys Lys Pro lie Arg Ala Thr Gin Lys Thr Tyr Phe Arg 
180 185 190 

Asn Asn Tyr Asn Tyr Val lie Arg Ala Lys Val Lys Glu lie Lys Thr 
195 200 205 

Lys Cys His Asp val Thr Ala Val Val Glu Val Lys Glu lie Leu Lys 
210 215 220 

Ser Ser Leu Val Asn lie Pro Arg Asp Thr Val Asn Leu Tyr Thr Ser 
225 230 235 240 

Ser Gly Cys Leu Cys Pro Pro Leu Asn Val Asn Glu Glu Tyr lie He 

245 250 255 

Met Gly Tyr Glu Asp Glu Glu Arg Ser Arg Leu Leu Leu Val Glu Gly 
260 265 270 

Ser He Ala Glu Lys Trp Lys Asp Arg Leu Gly Lys Lys Val Lys Arg 
275 280 285 

Trp Asp Met Lys Leu Arg His Leu Gly Leu Ser Lys Ser Asp Ser Ser 
290 295 300 

Asn Ser Asp Ser Thr Gin Ser Gin Lys Ser Gly Arg Asn Ser Asn Pro 
305 310 315 320 

Arg Gin Ala Arg Asn 

325 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1893 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GGCGGAGCGG GCCTTTTGGC GTCCACTGCG CGGCTGCACC CTGCCCCATC TGCCGGGATC 60 

ATGGTCTGCG GCAGCCCGGG AGGGATGCTG CTGCTGCGGG CCGGGCTGCT TGCCCTGGCT 120 

GCTCTCTGCC TGCTCCGGGT GCCCGGGGCT CGGGCTGCAG CCTGTGAGCC CGTCCGCATC 180 

CCCCTGTGCA AGTCCCTGCC CTGGAACATG ACTAAGATGC CCAACCACCT GCACCACAGC 240 

[Page 18] 



ACTCAGGCCA ACGCCATCCT 


GGCCATCGAG 


p a pttpp ta a p. 

Livb X 1 Lvs AA\3 




P A P OP A P TP.P 


ouu 


AGCCCCGATC TGCTCTTCTT 


CCTCTGTGCC 


A X G X AGGGvjL 


P P 71 TP TflP A P 


P ATTP.APTTP 
LhX XIjAL lit 


0 D VJ 


CAGCACGAGC CCATCAAGCC 


CTGTAAGTCT 


uXuX owuAvjL 


PPPPPPP.PP A 


P.P.P.PTP.TP.AP. 
OV?VjL X var X VjAvj 


H £ U 


CCCATACTCA TCAAGTACCG CCACTCGTGG 


CCGGAGAACG 


TGGGLXGGGA 


PPTAPPTPPP A 
V?GAGL X btUn 


AQf\ 


GTGTACGACA GGGGCGTGTG 


CATCTCTCCC 


p tv r*r*s*r* tv tp r* 
GAGGCCATLG 


rnT7V PTPPPP7A 

T XAG XGGGGA 


PPP2v.PPTP.AT 


Dfk v 


TTTCCTATGG ATTCTAGTAA CGGAAACTGT 


ta ptv f*r*r*f*r* tv tv 
AGAGGGGGAA 


PP 7a PTP 7A 7A PP 

GG AG X GAAL vj 


PTPTA A ATP.T 
L X \3 X Annl Vja 


ouu 


AAGCCTATTA GAGCTACACA 


GAAGACCTAT 


x rGGGGAALA 


7ATTTAP TA APT A 
AX X ALAAL 1 A 


TPTP ATTPCP, 
X \3 X LA X 1 wVjrvj 


0 0 u 


GCTAAAGTTA AAGAGATAAA GACTAAGTGC 


p ta tptatptpa 
G A X G A ibl G A 


PTPPAPTAP.T 
U 1 VjLAl? X Avs X 


cc TA pptp. a a p- 

vjvjAVjvj X VjAAvj 


1 &\J 


GAGATTCTAA AGTCCTCTCT 


GGTAAACATT 


LGALovjVjALA 


PTPTPTA 7APPT 


P T TA T & P P A p. p 


/ 0 u 


TCTGGCTGCC TCTGCCCTCC 


ACTTAATGTT 


tv 7v rn 7\ r* A ta T 
AA 1 G AGG AA X 


tv rn tv rnr* 7A TP 7A T 


PP.P.PTATP.A TA 
oVjVjU X A X 0 AA 


q a n 
0 % u 


GATGAGGAAC GTTCCAGATT 


ACTCTTGGTG 


GAAGGCTC J; A 


T TA P P TP TA P TA TV 

X AGG X 0 AoAA 


PTPP A 7A PP. TAT 
\s X V?V7AAV7uAX 


y u vj 


CGACTCGGTA AAAAAGTTAA GCGCTGGGAT 


ATGAAGC1 TG 


PTPTATPTTPP 

GXGAXGX XVjG 


A PTP APT A A A 
AL X wAlsXAAA 


i7 o u 


AGTGATTCTA GCAATAGTGA 


TTCCACTCAG 


tv m /"> tv 0 TV tv 0 m 

AGTCAGAAGT 


rrpPPPTi PP TV TA 

CTGGCAGGAA 


P TP P TV TA P P P P 

GTGGAAGGGG 


i n 0 n 
x u 


CGGCAAGCAC GCAACTAAAT 


CCCGAAATAC 


TVTVTVTVTVOmTV TV O 

AaAAAGTaAC 


TV P TV pmpp TV PT 

AGAGXGGAGX 


TPPT7A TTTA TA P 

XGGXAX XAAb 


X Uo u 


ACTTACTTGC ATTGCTGGAC 


TAGCAAAGGA 


tv ta tv mm/"*» »nm 

AAATTGCACT 


TVTTPPTAPTA TP 

Al TGLAGAXG 


TA A rpTP T TV TT 

AXAX XGXAX X 


X x *± u 


GTTTACTATA AAAATCATGT 


GATAACTGAT 


rri Tv rri rrt tv f ipcripni 

XAX X AG X XLX 


PTTTPTPTTT 

oX X X w XL X X X 


TPPTTTPTP_P 
X VJVJX X ILlVjL 


X £ v v 


TTCTCTCTTC TCTCAACCCC 


TTTGTAATGG 


1 J. 1 vjvjVjvjVjC A 


P. A P TP TT A A P. 
Vj Aw ivil AAV? 


TATATTP,TfiA 


1260 

J. £0 V V 


GTTTTCTATT TCACTAATCA 


TGAGAAAAAC 


T P TT P T T rn rn 
IGi iUllllu 


PAATAATAAT 


A A ATT A A APA 
AAAX X AAAwA 


X J v 


TGCTGTTACC AGAGCCTCTT 


TGCTGAGTCT 


PP TV P TV TPTTTV 

CCAGAxGl TA 


AX X XAGX XXL 


TPP APPPPA A 


1 JOV 


TTGGGAATGC AATATTGGAT 


GAAAAGAGAG 


GTTTCTGGTA 


fPT/^TVPTVPTV TA TV 

TTCAlAGAAA 


PPTTVPTVTTA TP 

GGXAGA1A1G 


1 a a n 

X *t t U 


CCTTAAAACA TACTCTGCCG 


ATCTAATTAC 


iv /"* s\ m m tv mmm 

AGCCTTATTT 


TTGTATGCCT 


TTTPPPP TV rpm 

TTTGGGCAT1 


XDUU 


CTCCTCATGC TTAGAAAGTT 


CCAAATGTTT 


tv m TV 7v TV O T TV TV 

ATAAAGGTAA 


TV TATPPP APTT 

AA X oGL AVj X X 


TPA APTP A A A 
X VjAAo X L AAA 




TGTCACATAG GCAAAGCAAT 


CAAGCACCAG 


GAAGTGTTTA 


TGAGGAAACA 


ACACCCAAGA 


1620 


TGAATTATTT TTGAGACTGT 


CAGGAAGTAA 


AATAAATAGG 


AGCTTAAGAA 


AGAACATTTT 


1680 


GCCTGATTGA GAAGCACAAC 


TGAAACCAGT 


AGCCGCTGGG 


GTGTTAATGG 


TAGCATTCTT 


1740 


CTTTTGGCAA TACATTTGAT 


TTGTTCATGA 


ATATATTAAT 


CAGCATTAGA 


GAAATGAATT 


1800 
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ATAACTAGAC ATCTGCTGTT ATCACCATAG TTTTGTTTAA TTTGCTTCCT TTTAAATAAA I860 
CCCATTGGTG AAAGTCAAAA AAAAAAAAAA AAA 1893 
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